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VLIW PROCESSOR WITH DATA SPILLING MEANS 



TECHNICAL FIELD 

The present invention relates to a VLIW processor comprising: a plurality of 
functional units; a distributed register file conq)rising a plurality of segments, the distributed 
register file being accessible by the functional units; a communication means for 
5 communication with a memory; a communication network for coupling the fimctional \mits 
and the distributed register file. 

BACKGROUND ART 

An embodiment of the VLIW processor of the kind set forth is disclosed in 
10 WO 00/33178. 

A Very Long Instraction Word (VLIW) processor allows exploiting 
instraction-level parallelism in programs and thus executing more than one instruction at a 
time, A VLIW processor uses multiple, independent fimctional units to execute multiple 
operations in parallel. VLIW processors package multiple fimctional imit operations into one 

15 very long instruction. 

Limitations of VLIW processing include limited hardware resources, Umited 
parallelism and a large increase in code size. Limited hardware resources may be the 
fimctional units, the register file or the communication network. Anticipating these 
limitations by adding more resources has some serious drawbacks. For example, when 

20 increasing the number of fimctional tmits, the memory size and register file bandwidth will 
have to increase as well. Furthermore, a large number of read and write ports are necessary 
for accessing the register file, imposing a bandwidth that is difficult to support without a 
large cost in the size of the register file and degradation in clock speed. 

For some applications to be run on the VLIW processor the limiting hardware 

25 resource may be the register file. A large register file could be used to preveat this. However, 
this has several drawbacks. First, a register file with many registers may create critical timing 
paths and ttierefore limit the cycle time of tihe processor. Second, as the number of directly 
addressable registers increases, the number of bits to specify the multiple registers within the 
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instruction increases as well Third, a register file with many registers occupies a large silicon 
area. 

DISCLOSURE OF INVENTION 
5 An object of the invention is to provide a VLIW processor architecture, which 

prevents the register file being the limiting hardware resource. 

This object is achieved with a VLlW processor of the kind set forth 
characterized in that the VLIW processor furthw comprises spilling means for transferring 
data between the distributed register file and the commnnication means. 
10 It is to be noted that spilling itself is known, but fix)m a completely difDsrent 

field of technology, viz. fi-om compiler technology. The principle is that when a compiler is 
generating machine code and there are more Uve variables than the machine has registers, 
some variables are transferred or "spilled" &om registers to memory. 

The spilling means have the capability to transfer values firom the register file 
15 to the communication means, via the functional units. The spilling means also have the 
capability to restore values by transferring them firom the communication means to the 
register file. 

An embodiment of the invention is characterized in that the spilUng means 
comprise a spill register file and a spill unit, the spill register file being accessible by the spill 
20 unit. An advantage of this embodiment is that it prevents that the speed of transferring values 
by the spilling means to the communication means limits the clock firequency of the 
processor. 

An embodiment of the invention is characterized iu that at least one of the 
segments of the distributed register file is associated with a fimctional unit that is imable to 
25 pass data firom the segment of the distributed register file to the spilling means, a pass unit 
being associated with the fimctional unit for passing data firom the segment of the distributed 
register file to the spilling means. In case a fimctional unit, such as a ROM unit, is not able 
to pass the value that has to be spilled firom Ihe register file, the pass unit is capable of doiug 
so. 

30 An embodiment of the invention is characterized in that the communication 

network comprises a partially connected communication network. An advantage of this 
embodiment is that it reduces the overiiead of a fiilly connected network. 
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An embodiment of the invention is characterized in that the pass unit is part of 
the associated functional unit. An advantage of this embodiment is that no separate pass unit 
is required, saving additional silicon area and communication coimections. 

BRIEF DESCRIPTION OF THE DRAWING 

The features of the described embodiments will be further elucidated and 
described with reference to the drawing: 

The single figure is a schematic diagram of a VLIW processor in accordance 
with an embodiment oflBepresent invention. 



DESCRIPTION OF PREFERRED EMBODIMENTS 

Referring to the figure, a schematic block diagram illustrates a VUW 
processor comprising a plurality of functional units including functional units 1 and 3. The 
processor has a distributed register file 4 including register file segments 5, 7 and 9. Hie 
15 functional unit 1 can read from register file segment 5. The functional unit 3 can read from 
the register file segments 7 and 9. 

The processor includes a communication unit 11 for communication with a 
memory. The latter is not shown in this figure. The processor also includes a communication 
network 13, e.g. a bus, that couples the functional units 1 and 3, and the register file segments 
20 5,7and9. 

Furthemiore, the processor comprises a spilling device 15. The spilling device 
15 and the register file segments 5, 7 and 9 are coupled via the communication network 13. 
The spilling device 15 and the functional units 1 and 3 are also coupled via the 
communication networic 13. The spiUing device 15 allows transfenring values fit)m flie 

25 register file segments 5, 7 and 9 to the conimunication unit 11, via the functional units 1 and 
3. Furthermore, the spilling device IS can restore values by transferring them from the 
communication unit 11 to the register file segments 5, 7 and 9. 

Preferably, the spilling device 15 comprises a spill register file 17 and a spill 
unit 19. The spill unit 19 can read from the spill register file 17. An advantage of this 

30 embodiment is that it prevents that the speed of transferring values by the spilling device 15 
to the communication unit 11 limits the clock frequency of the processor. In some 
embodiments, values can be temporarily stored in the spiU register file 17, without passing 
them to the communication unit 11. From the spill register file 17, these values can be 
transferred to the register file segments 5, 7 and 9 by the spill rniit 19. An advantage of this 
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embodiment is that is prevents the time consuming actions of writing to and restoring values 
from the communication unit 11. 

Li an advantageous embodiment, the processor also has a pass unit 21. The 
pass unit 21 can read from the register file segment 5, and the pass \mit 21 and the spilling 
5 device 15 are coupled via the communication network 13. The pass unit 21 allows passing 
ttie value that has to be spilled from the register file segment 5, in case the frmctional unit 1 is 
not capable of doing so. In some embodiments multiple pass units will be present, in other 
embodiments none will be present. A pass unit can be associated with a frmctional unit if that 
frmctional unit, e.g. a ROM unit, is not capable of passing a value from a register file 
10 segment. 

In the embodiment shown in this figure, the pass unit 21 is a separate unit 
associated with frmctional unit 1. In some embodiments, the pass unit 21 can be part of the 
associated frmctional unit 1 by adding an additional operation to this frmctional unit that 
allows to pass a value from a register file segment. 

IS The frmctional units can be coupled to one register file segment, as in case of 

the frmctional unit 1, or to multiple register file segments, as in case of the frmctional xmit 3. 
The register file segments can be coupled to one frmctional unit or to multiple frmctional 
units. The degree of coupling between the register file segpients and the frmctional units can 
depend on the type of operations tiiat the frmctional imit has to perform. 

20 The commimication network 13 couples the frmctional units 1 and 3, and the 

register file segments 5, 7 and 9. The embodiment shown in this figure has a frilly connected 
communication network, which means that every frmctional unit 1 and 3 is coupled to every 
register file segment 5, 7 and 9. In some embodiments the communication network 13 is 
partially connected, so not every frmctional unit 1 and 3 is coupled to every register file 

25 segment 5, 7 and 9. An advantage of a partially connected network compared to a frilly 
connected network is that it is less expulsive in terms of silicon area and it allows for a 
higher clock frequency of the processor. 

The frmctional unit 1 and 3 as well as the pass unit 21 are coupled to the 
spilling device 15. In the embodiment shown in this figure, these couplings are part of the 

30 communication network 13. In some embodiments, one or more of these couplings may be 
sq)arate. 

The spilling device 15 is coupled to the register file segments 5, 7 and 9. In the 
embodiment shown in this figure, these coiq)lings are part of the communication network 13. 
In some embodiments, one or more of these couplings may be separate. 
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It should be noted that the above-mentioned embodiments illustrate rather than 
limit the invention, and that those skilled in the art will be able to design many alternative 
embodiments without departing from the scope of the appended claims. In the claims, any 
reference signs placed between parentheses shall not be construed as lindting the claim. The 
5 word "comprising" does not exclude the presence of elements or steps other than those listed 
in a claim. The word "a" or "an" preceding an element does not exclude the presence of a 
plurality of such elements. In the device claim enumerating several means» several of these 
means can be embodied by one and the same item of hardware. The mere fact that certain 
measures are recited in mutually different dependent claims does not indicate that a 
10 combination of these measures cannot be used to advantage. 



